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Abstract 

Many nonlinear extensions of the Kalman filter, e.g., the extended and the unscented Kalman filter, reduce 
the state densities to Gaussian densities. This approximation gives sufficient results in many cases. However, 
this filters only estimate states that are correlated with the observation. Therefore, sequential estimation of 
diffusion parameters, e.g., volatility, which are not correlated with the observations is not possible. While 
other filters overcome this problem with simulations, we extend the measurement update of the Gaussian 
two-moment filters by a higher order correlation measurement update. We explicitly state formulas for a 
higher order unscented Kalman filter within a continuous-discrete state space. We demonstrate the filter in 
the context of parameter estimation of an Ornstein-Uhlenbeck process. 

Key words: Sequential Parameter Estimation, Nonlinear Systems, Unscented Kalman Filter, 
Continuous-discrete State Space, Estimation of Uncorrelated States, Volatility Estimation 



1. Introduction 

Stochastic filters are powerful tools for simultaneous estimation of parameters and unoberserved states 
from noisy data. In nonlinear filtering problems, however, in contrast to the classical linear Kalman filter set- 
ting nonlinear transformations of probability densities have to be evaluated. Since these transformations 
cannot be tracked analytically, approximations have to be used to construct nonlinear filters. This paper 
deals with a new approximation based on martingale estimation functions, which enables correlation-based 
nonlinear filters to estimate states which are not directly correlated with observations like, for example, 
stock price volatility. 

A detailed survey of different nonlinear filters and their approximations is given in Q (see also [^). 
Generally, one can distinguish between simulation-based approaches and analytical approximations. While 
simulation-based approaches often lead to more accurate descriptions of the densities, analytical approx- 
imations are superior with respect to computation costs. The most common analytical approximation is 
capturing only the first two moments of the densities, i.e., assuming Gaussian densities. This strongly sim- 
plifies all filter equations and leads to computationally very fast filters. The extended Kalman Filter (see, 
e-g-, 01) and the unscented Kalman Filter (see, e.g., ^) are well-known examples for such Gaussian filters. 

Assuming Gaussian densities means assuming Gaussian dependence. For this reason, these filters are only 
able to estimate states and latent states that are linearly correlated with the observations. However, the cases 
where there is no linear correlation are common and important. In the context of econometric modelling, is 
the estimation of stock price volatility a typical example. Since price observations are not linearly correlated 
with the current volatility parameters, Gaussian filters are not able to sequentially estimate volatility given 
a sequence of price observations. Therefore other filters like simulation-based filters or particle filters have 
to be used (see, e.g., [6]). However, especially in the context of high frequency financial data, computation 
time is a crucial point. Therefore, in many applications the use of simulation-based filters is not possible. 
For these applications, fast stochastic filters are needed which are able to estimate diffusion parameters. 
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In this paper, we extend the filter equations of Gaussian filters by a higher order measurement update. 
This measurement update relies on the theory of martingale estimating functions and uses additional infor- 
mation on the squared prediction error, i.e., the square of the difference between prediction of an observation 
value and the actual observation value. The update of normal Gaussian filters uses only the correlation of 
the prediction error with the states. The correlation of the squared prediction error relies on information 
on higher moments of the densities, that have to be propagated in the time update steps of the filters. This 
can be realized in different ways. We explicitly state an algorithm for a higher order correlation unscented 
Kalman filter. We believe, that an algorithm in the sense of an higher order correlation extended Kalman fil- 
ter is also possible to implement, but relies on quite complicated moment equations which are not necessary 
for the unscented filter. For the unscented filter we use an extended version of the unscented transformation 
to capture the information on higher moments. We demonstrate the applicability of the algorithm in a 
simulation study estimating the parameters of an Ornstein-Uhlenbeck process. 

The paper is organized as follows. In section[21 nonlinear state estimation in the framework of continuous- 
discrete state space models is discussed deriving the general filter equations for Gaussian filters and introduc- 
ing a higher order correlation update. In section [3j higher order filter equations for the unscented Kalman 
filter are derived. Section U] contains a simulation study analyzing bias and variance of the filter estimates 
in the context of an Ornstein-Uhlenbeck model. Section [S] concludes. 

2. Continuous discrete higher order correlation filters 

2.1. State space models and parameter estimation 

We concentrate on continuous discrete state space models (see [4]) as they are very useful in systems, in 
which the underlying models are continuous in time and in which only discrete observations are available. 
Since these models are more general than discrete state space models, the results in this paper also hold for 
the discrete case. 

The continuous discrete state space consists of a continuous state equation for the state y{t) (equation 
([T])) and discrete measurements at times ti,i = 1 . . . n, which do not have to be evenly spaced in time 
(equation 

dy{t) = f{y{t),t,^)dt + g(y(1^,t,-^)dW(1^, (1) 
= h{y{ti),t,) + e,. (2) 

The first equation is a p-dimensional Ito differential equation with an r-dimensional Wiener process W{t). 
The drift coefficient / : x M x M" — > Wp and the diffusion coefficient 5 : Rp x R x M" — ^ Rp x M'' 
are functions of the state, the time and a u-dimensional parameter vector The measurement equation 
projects the state vector y(t) onto the time discrete fc-dimensional measurements z^. The measurement may 
be noisy with the fc-dimensional discrete white noise process ^ N{0, R{ti,'^)), e^, identically distributed 
and independent of W{t). 

The state space notation has an inherent way of parameter estimation by considering the parameter 
vector 5* as a latent state variable. The state vector is augmented by the parameter vector {y ^ y = [^] ) 
with trivial dynamics [d^^ = 0). This leads to the extended state space model: 

dy{t) = f{yit),t,^)dt + giyit),t,^)dWit) 
= 

= h{y{ti),ti) + e^. 

Filtering this state space model results in a sequential estimator of the unknown parameters: ^ = E['i'{t)\Z^] . 
Note, that this approach of parameter estimation leads to nonlinear problems, even if the original state 
equations are linear. Therefore, linear filters like the Kalman filter are inapplicable for this approach. 
Standard nonlinear filters as the extended Kalman filter or the unscented Kalman filter are applicable but 
do not estimate parameters of the diffusion coefficient g{y{t),t,'^), e.g., volatility. 
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2.2. Time and measurement update for Gaussian filters 

Continuous discrete stochastic filters estimate the probabihty density of the state vector from noisy 
observations. The algorithms consist of two steps. There is no measurement information for t All 
information is based on time ti and the density p{y,t\Z^) is obtained by propagating p(jj,ti\Z^) according 
to the dynamic given by the state equation ([T]) . This results in integrating the Fokker-Planck equation (see, 
6-g-j This step is the time update. For t = ii+i, with current measurement information available, the 
a-posteriori (after measurement) state density p{y,ti+i\Z'^~^^) is estimated via the Bayes' theorem using the 
a-priori density p{y\Z'^) of the time update step and the measurement information p(zi^i) and likelihood 
p(zi-|_i This step is the measurement update. Time update and measurement update are recursively 
repeated. Note that both steps can be solved explicitly only for linear systems and Gaussian densities and 
some special cases. Therefore, various numerical and simulation-based methods exist to approximately solve 
both steps (see for example |7j and the reviewing introduction therein). 

A common simple and effective approximation is assuming Gaussian densities although the transforma- 
tions are nonlinear. This simplifies the equations and is done, e.g., by the extended Kalman filter (EKF, 
01) and the unscented Kalman filter (UKF, Q). These Gaussian filters only estimate mean /i and variance 
S of the state densities and are exact for linear problems, i.e., linear transformations of the densities. The 
Fokker-Planck equation of the time update step then reduces to the moment equations 

m = E[fiyit),t)\Z'^] (3) 
tit) = Cov[f,y\Z'] + Cov[yJ\Z'] + E[n\Z% (4) 

with Q = gg' . A dot over the variables denotes differentiation with respect to time. Since these equations 
depend on the conditional densities p{y, t\Z^), they have to be solved approximately. The extended Kalman 
filter uses linear Taylor expansions to approximate the functions / and g, while the unscented Kalman filter 
approximates the densities of f{y) and g{y). The measurement update is simplified using the theorem of 
normal correlation for jointly distributed Gaussian random variables^ Using subscripts i for at time ti and 
for at time i^+i based on information of ti and taking the measurement equation (0) into account with 
hi := h{y{ti)), the measurement update becomes 

= l.M+i\i + Ki+i{zi+i - E[hi+i\Z']), (5) 

= Y.i+i\i - Ki+iCoy[hi+i,yi+i\Z'], (6) 

where 

K,+i = Coy[y,+i,h,+i\Z'](YaT[h,+i\Z'] + i?,+i)- (7) 

is a regression matrix, called Kalman gain, and (•)" denotes the pseudoinverse. Expectations and covari- 
ances have to be approximated. The extended Kalman Filter uses Taylor expansions of the respective 
functions around Hi+m and the unscented Kalman filter uses the unscented transformation to approximate 
the transformed densities (see section |3]). 

Due to the role of the covariance matrices Cov[yi+i, hi+i\Z'^] in equations ^ and ©, only the states 
which are correlated with the measurement are updated. States (or parameters) that are connected to the 
measurement through the diffusion coefficient g, e.g., volatility parameters, are generally not correlated with 
the measurement, i.e., they are not updated/estimated by the filter. In the next section, we extend both 
steps with a higher order correlation component to overcome this problem. 



^Let X and Y be normally distributed random vectors with expectations fix^ IJ-Y a-nd covariances SxXi "^YY ^ "^XY ^ respec- 
tively. Then ^[Xly] = /Jx + ^xy'^yyO^ ~ ^^'^ Var[X|y] = T,xx — ^xy^yy^y x , where is the pseudoinverse of 
T.YY- 
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2.3. Higher order correlation measurement update 

States which are not hnearly correlated with the measurement are not updated via the theorem of 
normal correlation and cannot be estimated by Gaussian filters. In these cases, the corresponding entries 
in the covariance matrix Cov[yi+i, are zero and the prediction error Vi-^-i = (z^+i — |^*]) in 

equation ^ does not provide information for a linear update of these states. However, in most cases the 
squared prediction error vfj^i = {zi+i — hi^i\if' provides additional information on these states. Therefore, 
a measurement update is constructed in this section which also consists of a nonlinear part based on the 
squared prediction error. 

The measurement update in equation ([5]) is closely related to the theory of martingale estimating func- 
tions and quasi-likelihood models for stochastic models. If 6* is a real valued random variable on a probability 
space (O = {w}, J^, 7^), an estimating function for 6* is a function G = g{uj,6) on x R. It is unbiased if 
E-p [g{uj,9{uj))] = 0, where E-p is the expectation over V (see, e.g., Q for a survey). 

For example, let Xi, i — I . . .n he n independent random variables with finite mean mi{9) and vari- 
ance Vi{9). Then E[Xi — mi(9)] — since the sequence Xi — mi{9) is a martingale difference. Since 

E 



{X,-m,{9)Y ^v,{9) 



E 



{X,~m,{B)Y -v,{9) 



^ 



holds in the same way, which yields the estimating functions for 6 



= ^wi,, {X,-m,{9j), 

i=l 
n 

= Ew2,» \{X, - m,{9)f ~ v,{9) 



(8) 
(9) 



where wi/2,i are weights. Together with the observation equation the linear estimating function ([5)) 
directly leads to the linear filter equations ^ and See, e.g., [lo[ or [llj for a derivation. 

Similar to the problem considered in this paper, i.e., the failing of the linear measurement update, there 
are problems, where linear estimating functions of the form ([5]) fail to estimate the value of 9. An early 
reference with examples for this effect is [l^. A solution is to combine equations (|S]) and © to one single 
estimating function 



n 



(10) 



to overcome this problem. Much work has been done to find the new optimal weights w* ^ and j, see, e.g., 
[13, or [l3| for the restriction of orthogonality of equations ([5]) and © and [15| for the general case. 
Wefelmeyer [l^l showed that this estimator ([TUll may be replaced by the one-step estimator 



(X,-me„(X,_i)) 



+Be„ ((X, - me„(X,_i))' - ue„,_i(^0) 



where 0„ is an initial estimator for 9. The weights A, B, C, and / are 



In = {Ae.^^im'e^{X,^i) + Be.,v'f,^{X,^i)) 
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see, e.g., (l5| . The /ii,/i2,/i3 are centered moments, 'mg^{Xi^i) is a predictor of Xi based on 0„ and Xi^i, 
and V0^(Xi^i) is a predictor of the variance of Xi. The primes ' denote derivatives with respect to 9. For 
n = 1, i.e., one observation time and assuming /i3 = for simphcity, i.e., orthogonahty of equations ([5]) and 
0, the weights reduce to: 

where we make the further assumption that the terms {m! {n^ — /i2)^) ^ v' ^2v' respectively (f V2) iTi'(fi4 — 
A*2)^to' are neghgible. In both assumptions we assume that either the hnear information or the quadratic 
information on 9 dominates and not both terms provide the same level of information. 

With the estimator fii+i\i for fii^m^i (notation as is section r2.2l) this yields a new measurement update 
equation for the mean: 

= + Kg\ X + k\% X - E[y}+,\Z']) , (13) 

where i/^+i = (z^+i — E[hi^i\Z^]) is the observation error and Ki and K2 are linear regressor matrices, with 

K^^2i ^ Cov[y,+i, h,+i\Z'] (Var[/i,+i|Z'] + , 
= Cov[y,+i, v^+^\Z^] (Var , 

where the derivatives in equations pT|) and ([T^ are replaced by the respective regressors, i.e., covariances 
divided by variances. Ki and K2 may be referred to as first and second order Kalman Gain, respectively, 
where Ki coincides with the classical Kalman Gain given in equation ([7]). 

The equation for the variance after measurement ^ is extended analogously leading to 

= E,+i,, - ifW Cov[/i,+i,2/,+i|Z^] - if(^\Cov[;.2^i,y,+i|Z^]. (14) 

Note that this approach for the variance is fairly rough. It follows from the assumption of a joint normal 
distribution of squared prediction errors and states. This implies independence of the results of the mea- 
surement and its information content, i.e., only the fact that there is new information and not the value of 
the measurement contributes to the variance update. In the case of diffusion parameters, however, small 
prediction errors are likely for all values of the diffusion parameters, whereas large errors clearly indicate 
large parameters. Therefore, large prediction errors contain more information on the diffusion coefficient 
than small errors. In a more accurate approximation, the value of v would therefore be part of the variance 
update P^ . 

The higher order measurement update equations (jl3p and p4l) contain higher moments of the state 
vector y, i.e., a forth centered moment and the covariance matrix Cav[yi+i,vfj^^\Z^] which contains mixed 
third order moments of the state vector y. Thus, the time update steps of the filter algorithms have to be 
extended by a propagation of the respective information. In the next section, we will discuss how the update 
can be applied within the unscented Kalman filter framework. 

3. Higher order correlation unscented Kalman filter 

The unscented Kalman filter as introduced by Julier and Uhlmann [sl is based on the unscented trans- 
formation which is briefly reviewed in subsection 13.11 The unscented transformation captures the first two 
moments of densities undergoing nonlinear transformations. Since information on higher moments is needed, 
the unscented transformation has to be extended to capture these moments which is done in subsection 13.21 
In subsection 13.31 a higher order unscented Kalman filter is presented. 



(It \2 I , I '\~^ l( 2\ 1-1 

[m — 112) m + V /i2f j m (/X4 — /ij) ~ m = 
[m [fi4 - ^2) m + V j V ^2~ V = 



dm ' 



(11) 
(12) 
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3.1. Unscented transformation 

The unscented transformation is a method for calculating the transformation of the density of a random 
variable which undergoes a nonlinear transformation (see (5|). In the original framework of '5!|, the density 
p{y) of the random variable y gMp is approximated by the sum 

p 

with the Dirac delta function (5, the n = 2p + 1 sigma points y^''\ and the weights a; Furthermore the 
sigma points and weights are chosen such that the first two moments of the density of y are replicated: 



E[y] = j yp{y)dy ^ j ypuT{y)dy 



In contrast to Monte Carlo approaches, this choice is deterministic. Julier and Uhlmann [HI establish the 
following choice: 

y«=i?[j/] + (V(p + «)VarM)^, ^ i = l---p, 

2;(^+^')=.i?M-(Vb + '^)Var[y])^, = 1 = 1- --p, 



y 



(p + k) ' 



where (.^/rT7)i is the i-th row or column of the matrix root. The real parameter n gives an extra degree 
of freedom which is discussed in detail in |5]. For Gaussian densities of y, they recommend p + k = 3. In 
many cases, however, k = works well (see, e.g., I^SJ) which reduces the number of sigma points by one. 
After undergoing a nonlinear transformation y — > /(y), the expectation, the covariance of f{y) and the cross 
covariance of /(y) and y can be computed by: 

n 

E[~f{y)] = ^c.W/(y<-'')), 
i=i 

Var[/(y)] = J] ^'^'^ (/V^'^) - ^^[/(y)]) (/V^') - i?[/(y)]) , 
i=i 

Cov[/(y),y] = ^c.t-') (/(yf^)) - E[~f{y)]) (j/^^' - i?[y]) , (15) 
i=i 

where (...)(...)' denotes the outer product. 



3.2. Higher order unscented transformation 

To capture the state densities with higher accuracy, the set of sigma points of the unscented transforma- 
tion may be extended (see, e.g., Q). In the following, we present an approach based on numerical integration 
rules for exact monomials given by [l3| and [3| to derive an extended set of sigma points and weights. 

Definition 1 (Generator, A point u = (ui, u™, 0, 0) G W where < Ui < Wi+i is called 

generator and denoted as [±u] or [±mi, ...,±Um\. It represents the set of points that can be obtained from u 
by permutations and changing the sign of some coordinates. 
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For example, the generator [0] contains only the point G MP, while the generator [±^,±1] in M'^ is the 
set of points 

{ (1,1,0), (-1,-1,0),!, -1,0), (-1,1,0)(1,0,1), (-1,0,-1), 
(1,0,-1), (-1,0,1), (0,1,1), (0,-1,-1), (0,1,-1), (0,-1,1)}. 

In the sense of fl3|, we write f[±ui, . . . , ±Ur] for the sum X]xg[±mi ±u ] /(^)- 

In generator notation, the unscented transformation of Julies and Uhlmann fs'l with n ~ 2p + 1 sigma 
points and initial iV(0, /) Gaussian densities corresponds to an integration of the form 

/oo 
N{x; 0, 1)fix)dx « Luofm + ^if[±u], 
-oo 

with the weights ujk and the generators [0] and [±-\/3]. 

This integration rule is of precision 3, i.e., it is exact for all monomials of degree 3 or less without any 
odd power. To derive higher accuracy in the filtering algorithm, we use a rule of the form: 

/oo 
N{x; 0, 1)f{x)dx « wo/[0] + wi/[±u] + uj2f[±u, ±u]. 
-oo 

This rule leads to n = + 1 sigma points as we get 1 point from the generator [0], 2p points from [±u] 
and 2p{p — 1) points from [±u,±m]. For u = v^, wq = 1 + ^ ij^ ■: = uj2 — the rule is exact 

for monomials of degree 5 or less without any odd power (see, e.g., [l3| and [13 )■ 

These sigma points capture integration with respect to standard normal Gaussian densities 7V(0, /). 
General Gaussian densities N{fj,, S) are captured by multiplying each sigma point with a square root of the 
covariance matrix S and adding the mean /z. Finally we get the following 2p^ + 1 sigma points and weights: 

y« = E[y] + ■ [±V3p , 

The computation of moments and higher moments of transformed densities is performed analogously to 
equations (|15p . 



a;*-*' = 1 H — — , i corresp. [0], 

18 

o;'-*^ = , i corresp. [±\/3], 

18 

a;'*-* = 7^, i corresp. [±\/3, ±\/3]. 
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3.3. Filter algorithm 

The higher order unscented transformation can be used to evaluate the expectation, variance and co- 
variance terms of the filter equations leading to a higher order unscented filter. The time update is done by 
numerically solving the moment equations ^ and (U) by Euler integration of the sigma point sets. With 
regard to the time continuous nature of the state equations of the system, the Euler scheme may use a finer 
discretization interval St ^ than the measurement intervals t^+i — dividing them into Li = (tj+i —ti)/St 
parts. For notational convenience, we use the function / in the algorithm as an abbreviation of the Euler 
steps. The noise term is discretized on the Euler grid, i.e., dW — )■ AW ~ Af{0,Q) with Q e W^^. The 
p-dimensional state vector is then augmented by the r-dimensional noise terms AW {p — p + r) to 



y = 



V 
AW 



This leads io n — 2{p + r 

m 



1 = 2p^ + 1 (T-points based on 



'E[y] 




und Yjy 



Cov(AM^, y) := 
7 



Cov(?;, AT4^) := 



The noise terms are uncorrelated with the states {Gov[y, AW] := 0). 

After the time update, expectations, variances and covariances are evaluated according to equations psp . 
The measurement update is done by straight forward implementation of the equations (I13p and (jl4p . The 
following algorithm summarizes the resulting filter: 

Algorithm 1 (Higher order correlation unscented Kalman filter (HUKF)). 
Initialization: t = to 

Mo|o = H + Cov[yo,hn] x 

X (Var[/io] + i?o)"Cov[/io, yo] 
So|o = S - Cov[yQ\ho] x 

X (Var[/io] + i?o)"Cov[/io, yo] 
Lo = (t){zo;E[ho],Ya.T[ho] + Ro) 
Sigma points : y^^^ = y^-'^ S), j ^ 1 . . .n; fi = E[yo],T, = Va.T[yo]. 

Recursion: i — 0, T — 1 
Time update: t G 



yi+i\i 



3 = i 



E[h,+,\Z'] = ^t^W/i(/(y«,t„At„AM^/fi))-:/i,+i| 



Var[/i,+i|Z'] = 

Var[j.2^i|Z''] = 

Cov[y,+i,/i,+i|Z*] = 

Cov[y,+i, 1.2^1 = 



Var[/i,+i|Z'] + i?,+i -:i.2^i|, 

Y^J^^ [ At,, Aiy/^) )) - h,+. 



n 



^c.(^) At,, AT4^(^)J -y.+il 

X (h(f{y^^\u,AU,Awll!,)) - 

n 



Sigma points : y{j) = y^^\ni\^, j = 1 
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Measurement Update 

= Cov[y,+i,/i,+i|Z^](Var[/i,+i|Z'']+i?,+i)- 
k'^, = Cov[2;,+i,i.f+i|Z'](Var[i.2^i|Z^])- 



with the notations 

or on finer grid 

{w(j'),yJ.\AW^/^')} see text 

(• • • )^ for vectors is element wise 

/i is a short form for the measurement equation h{y). The subscript j|j denotes at time ti based on 
information at time ti, whereas the subscript denotes at time t^+i based on information at time ti. 

4. Simulation study 

In this section, we compare the presented higher order unscented Kalman filter to the unscented Kalman 
filter in a simulation study. The aim of the study is to show that the higher order terms in the filter equation 
lead to adequate estimates of states, which are correlated as well as uncorrelated to the observations. 

We apply the filter algorithms to the estimation of parameters of an Ornstein-Uhlenbeck process. 
Ornstein-Uhlenbeck processes are regularly used to model commodity prices (see, e.g., |.19j), electricity 
prices (see, e.g., [2Q|) or interest rates (see, e.g., [2l[). A continuous discrete state space representation is: 

dyit) - ^i[^2-y{t)]dt + -^3dWit) 

Zi = y{U) 

with the parameter set — ['Si, ^'2, ^3]. As before, the model consists of a time continuous state equation 
and a discrete measurement equation, where the observations z,; are the prices or interest rates at certain 
points ti in time and y corresponds to a latent underlying process. The parameter ^1 is the mean reversion 
rate, ^'2 is the long run mean of y, and ^3 corresponds to the volatility of the process. For simplicity, we 
suppress all units. Observation intervals ti^i — ti as well as the intervals of the euler grids are set to 1. 
To sequentially estimate the parameter vector as discussed in section [21 we augment the state vector y{t) 
according to y{t) = [y{ty , 5*]' and get an extended state space model 

dyi ^dy ^ y2 [ys - m (t)] dt + ndW{t) (16) 

dm = d*i = 

dys = ^^2 = 

dyt ^d^3 = 

Zi = yiiU)- 

Figures [T] and [5] show sequential estimates of these four state variables, where the true parameter values 
are — [i>i, 5*2, 3] — [0.5, 3, 2]. The filters are initialized with the initial parameters ^0 — [0.5, 3, 2.5] and a 
diagonal initial covariance matrix with standard deviations of 0.1, 0.1, 0.1 for the parameters, respectively. As 
expected, the UKF does not estimate the volatility parameter due to missing correlation between parameter 
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Figure 1: Estimation of the parameter set [Sfi, 'I'2, 'I's] using tlie higlier order cor- 
relation UKF initialized with [0.5,3,2.5]. The true parameter values are [0.5,3,2] 
and denoted by the dashed line. Confidence intervals are \/3 times estimated stan- 
dard deviations to each side, y denotes the realization of the Ornstein-Uhlenbeck 
process. 
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Figure 2: Estimation of the parameter set [y2,y3, y4] using the the UKF initialized 
with [0.5,3,2.5]. The true parameter values are [0.5,3,2] and denoted by the 
dashed lines. Confidence intervals are \/3 times estimated standard deviations to 
each side. The volatility parameter is not estimated, y denotes the realization of 
the Ornstein-Uhlenbeck process. 
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and observations. As expected, the higher order UKF estimates the volatiUty parameter since volatiHty and 
squared prediction error are correlated. The estimation of the other parameters seems unaffected by the 
additional terms in the measurement update. A more detailed analysis of the filters is provided in table 
[T] Again, we sequentially estimate the model, but this time the true parameter values ^' — [5*1, ^'2, ^'3] are 
drawn from a multivariate normal distribution, 

[^-1, *2, ^-3] ^ ([0.5, 3, 2], diag(0.1, 0.1, 0.1)) , 

i.e., the parameters are random but fixed. We simulate observations z^, i = 1, T with T G {10, ... , 1000} 
for A'' = 100000 times, where each of the repetitions has a different parameter set. The observations are 
filtered with initial parameters (mean and covariance) 

*o ([0.5, 3, 2], diag(0.1, 0.1, 0.1)) , 

i.e., the true distribution of the parameters. After T observations we get estimates (mean and variance) of 
each each parameter. 

Table [1] shows the filtering results obtained from HUKF and UKF. Four columns of results are shown 
for each filter. In the first column, the means of the deviations of the N — 100000 estimates from the 
original parameter m(j/^^) = mean[^j — = 1,2,3 are shown. For unbiased estimates, this number 

should fluctuate around zero which is fulfilled in all cases. In the second column, the mean squared errors 
m{i'^ ,) = mean[(5'j — = 1,2,3 are shown. The values should be small. The values of the third 

column are the means of the estimated variances of the filters ■m{a'^^ .) over the N=100000 simulations. 
Matching of the second and third column indicates that the estimates of the variances in the filter equations 
are unbiased. 

The fourth column (wo.gs) presents the proportion of the cases where the absolute deviation of mean 
estimate from the true value is larger than 1.96(T>f^. . For normally distributed estimators, these 
numbers should vary around 0.05. Since the distribution of in particular the volatility parameter '^3 is not 
normal we expect deviations. However, in all cases the deviations are small. 

Both filters show similar results for the parameters and ^'2. However, the results for the volatility 
parameter ^'3 deviate since the UKF does not update ^'3 and its variance after observations. Therefore, 
the third column (UKF) shows the initial value of the parameter variance (0.1) in all cases. The second 
column matches this value since the estimated values correspond to (always the same) initial parameter value 
^0;3 = 2 while the true parameter values ^1^3 are drawn from a distribution with variance 0.1 and mean 
^0;3 = 2. The first column shows values around zero since the initial parameter 5'o;3 = 2 is an unbiased 
estimator of the true parameters (which are drawn from a normal distribution around ^o;3 = 2). However, 
the UKF does not estimate this parameter. Contrary, the HUKF estimates this parameters very accurately 
and shows only a small bias for very small sample sizes which vanishes for sample sizes larger than T = 100. 
The estimated variances match the mean squared error and decrease rapidly with growing value of T. 

5. Conclusion 

This paper addresses sequential estimation of parameters and states in nonlinear state-space models in 
which the parameters are not correlated with the observations, as, for example, volatility. It is focused on 
nonlinear, Gaussian stochastic filters, i.e., filters that reduce all densities to their first two moments. These 
filters, like the extended Kalman filter or the unscented Kalman filter, are fast filters and give good results 
in many cases. However, when there is no correlation between states and observations, they fail to estimate 
the respective states and parameters. In this paper, higher order correlation filter equations are introduced, 
that are also based on Gaussian densities but do not rely on linear correlations. The approach shows large 
advantages over simulation-based filtering methods with respect to computing costs. Filter algorithms for a 
higher order unscented Kalman filter are explicitly stated. The filter is validated in a simulation study. 
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.051 




.053 


.043 


-.003 
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.023 
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.018 
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.008 


.048 


.001 
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.100 
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-.002 


.004 




.004 


.047 


-.000 


.100 


.100 


.050 






1000 


-.001 


.002 




.002 


.049 


-.001 


.100 


.100 
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Table 1: Sequential estimation of the three parameters ^j, J = 1,2,3 of model III6II with the higher order filter (HUKF) and 
the unscented Kalman filter (UKF) for different length T of time series. The estimates are repeated A'^ = 100000 times with 
true parameter values drawn from a multivariate normal distribution. Shown are the mean errors of estimation (m(i^^j)), 

mean squared errors of the estimates {m{u^ )), the mean of the estimated parameter variances (m((T^^j)) and the empirical 

exceedance ratios (mo. 95) of the estimated confidence intervals of the parameters (1.96 standard deviations to both sides). 
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